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Abstract 

Log-normal continuous random cascades form a class of multifractal processes that has already been successfully used in various 
fields. Several statistical issues related to this model are studied. We first make a quick but extensive review of their main properties 
and show that most of these properties can be analytically studied. We then develop an approximation theory of these processes 
' in the limit of small intermittency <C 1, i.e., when the degree of multifractality is small. This allows us to prove that the 

5-H ' probability distributions associated with these processes possess some very simple aggregation properties accross time scales. Such 

Oh! a control of the process properties at different time scales, allows us to address the problem of parameter estimation. We show 

. that one has to distinguish two different asymptotic regimes: the first one, referred to as the "low frequency regime", corresponds 

to taking a sample whose overall size increases whereas the second one, referred to as the "high frequency regime", corresponds 
to sampling the process at an increasing sampling rate. We show that, the first regime leads to convergent estimators whereas, 
in the high frequency regime, the situation is much more intricate : only the intermittency coefficient can be estimated using 
a consistent estimator. However, we show that, in practical situations, one can detect the nature of the asymptotic regime (low 
frequency versus high frequency) and consequently decide whether the estimations of the other parameters are reliable or not. We 
finally illustrate how both our results on parameter estimation and on aggregation properties, allow one to successfully use these 
f-H , models for modelization and prediction of financial time series. 
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Index Terms 

Scaling Phenomena, Self-similarity, Multifractal scaling, Intermittency, Parameter estimation. Financial time series. 

I. Introduction 
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I Ata displaying multi-scaling behavior are observed in various fields of applied and fundamental sciences: the velocity 
field of fully developed turbulent flows [1], financial time-series [2], [3], the telecommunication traffic load in high 



speed networks [4], medical time-series [5], [6], geological shapes [7] are only few of numerous examples. The paradigm 
of multifractal processes are multiplicative cascades originally introduced by the russian school [8] for modelling the energy 
cascade in fully developed turbulence and further studied by Mandelbrot [9], [10]. Very recently, continuous versions of these 
' processes have been defined: they share exact multifractal scaling with discrete cascades but they display continuous scaling and 
possess stationary increments [11]-[14]. Despite the huge number of mathematical studies devoted to discrete (e.g., [15]-[19]) 
or continuous random cascades [12], [14], [20]-[22], only very few works considered standard statistical problems associated 
. ^ with these processes (see however [23]-[27]). Our goal in this paper is to adress several statistical issues related to multifractal 
processes. 

^ The self-similarity of a process X{t) Q can be characterized by the power-law behavior of the g-order moments of its 
■ - - increments as functions of the scale 

e R, E [|X(t)|^] ~ Cqt^^('i), Vt < T, (1) 

where T is referred to as the "integral scale", it actually corresponds to a decorrelation scale. In the case the so-obtained 
"scaling exponents" (^x {q) are not depending linearly on q but is a concave function of q, the process is said to be a multifractal 
process. Since scaling of moments of different orders do not behave homogeously as the time scale is changed, the probability 
distribution of the increments of a multifractal process strongly depends on the scale of the increments. For random cascade 
models, one can show that the scaling exponent, as a function of q, corresponds to the cumulant generating function of a 
log-infinitely divisible law W, 

Cx(g) =inE[iy«]. (2) 

Given the infinitely divisible law W, the continuous cascade process is entirely defined as soon as its variance (i.e., a 
simple multiplicative factor) is fixed, as well as its integral scale T (i.e., the "size" of the cascade, or the decorrelation scale). 
In this paper, we will exclusively focus on continuous cascades with log-normal scaling exponents. 

E. Bacry and A. Kozhemyak are in the Centre de Mathematiques Appliquees, Ecole Polytechnique, 91128 Palaiseau, France. 
J.F. Muzy is in the Laboratoire Sciences Pour rEnvironnement, CNRS, Universite de Corse, UMR 6134, 20250 Corte, France. 

'We consider exclusively processes X(t) with stationary increments and such that X{0) = 0. In that respect, Vt', X{t' + t) — X{t' ), has the same 
distribution than X(t). 
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Among the whole class of log infinitely divisible cascade models, log normal cascades have the advantage of being fully 
determined by a single parameter A^, corresponding basically to the variance of In W. Thus, as shown by Eq. this parameter 
rules the non linearity of the scaling exponents (i.e., the "degree" of multifractality of the process). It is referred to as the 
the intermittency coefficient. Moreover, log-normal continuous cascades have a very simple alternative construction and, as 
we will see in the next section of this paper, most of their properties can be expressed under closed formulae. Though, they 
correspond to a particular log-infinitely divisible law, they are rich enough to raise challenging statistical questions such as 
estimator convergence in the context of long-memory correlation. Our approach reUes upon an approximation that allows us to 
precisely control the aggregation properties of the process. In fact, we show that in the small intermittency limit (A^ <C 1), a 
log-normal continuous cascade has increments that are "close" to be log-normal, at each scale, in a sense that will be precisely 
defined in the sequel. This approximation framework allows us to develop a method to estimate the process parameters. In that 
context, we are lead to introduce two distinct situations for asymptotic regimes: the first one, referred to as the "low frequency 
regime", corresponds to the classical notion of infinite observation scale at a fixed sampling rate while the second, referred to 
as the "high frequency regime", corresponds to sampling the process over a fixed observation scale at an increasing sampling 
rate. More precisely, if r is the sampling rate and L is the observation scale, the observed samples corresponds to the values 

{X(nr)}„g[o,Af[, where N = ^. (3) 

Both asymptotic regimes corresponds to ^ +oo, however, whereas the low-frequency regime corresponds to r fixed and 
L — > +00, the high-frequency regime corresponds to L fixed and r — > 0. From an experimental point of view, the first regime 
corresponds to the case where ^ 3> 1 and ^ ~ 1 whereas the second regime corresponds to the case where ^ ~ 1 and 
— ^ 1. We show that the properties of the parameter estimators are fundamentally different depending on the nature of the 
asymptotic regime (and allow, by the way, to test what the effective nature of the regime is). In the last section, we apply all 
these results on parameter estimation for the calibration of a multifractal model to account for volatility dynamics in financial 
time series. Moreover, making extensive use of the aggregation properties, we show that the so-obtained model provides highly 
performant methods to forecast risk. 

The paper is organized as follows: in section HI] we recall the main definition of log-normal cascades at the heart of this 
study and we state its main properties. We study both the case of the Multifractal Random Measure (MRM), a non-decreasing 
process, and the case of the Multifractal Random Walk (MRW), a Brownian motion subordinated by the MRM. The aggregation 
properties of the model are discussed extensively in section |lll] where we develop our small intermittency approximation theory. 
In this section, we first introduce a Gaussian process : the "renormalized magnitude" U{t) that will be involved in all the 
following approximations. We show that, in the case A^ ^ 1, in some sense to be defined, the variations of the MRM or of 
the MRW are closely related to those of il{t). Whereas subsection IIII-CI states a convergence theorem of the logarithm of the 
MRM towards in the hmit A'^ 0, the other subsections establish different MRM/MRW moment approximation theorems 
(when A^ ^ 1) as functions of ri(t) moments. In section |IV] we show how these approximations can be used to calibrate the 
model. The estimation issues are discussed within both low-frequency and high-frequency asymptotic regimes. We first show, 
in Section HV-AI that, in the low-frequency regime, the "Generalized Moments Method" (GMM) leads to convergent estimators 
whereas, as shown in Section IIV-BI in the high-frequency regime, the situation is more intricate. Indeed, in this regime, the 
integral scale T is shown to be a "fake" parameter and cannot be estimated. However, in experimental situation, the order 
of the GMM estimation of the fake parameter T is proved to give some hints about the nature of the asymptotic regime and 
consequently about the reliability of the estimation of T and a^. In Section HV-CI we exhibit a GMM type estimator of A^ 
that is proved to be consistent. Numerical experiments illustrate all the estimation results. In Section [V] we apply our results 
concerning parameter estimation tp the calibration of a model for asset price fluctuations in financial markets. We then show 
how the aggregation properties can be successfully used to perform conditional risk forecasting. Conclusions and prospects 
are provided in section IVTl while Appendices contain additional technical results. 

II. Definitions and main properties of log-normal continuous cascade models 

In this section we will first focus on the definition of Multifractal Random Measures (MRM) originially introduced in 
[14]. We will denote M[tQ,ti] the measure of the interval [to,ii] and M{t) the non decreasing process M{t) — M[0,t]. We 
propose two different approaches to define log-normal MRM. The first one relies upon the construction of some temporal 
Gaussian process which covariance mimics the observed "ultrametric" covariance of discrete Mandelbrot cascades while the 
second one involves random measures in a 2D half-plane. The direct construction has the advantage of being simpler and easy 
to implement while the second construction can be easliy extended to other infinitely divisible laws than the Gaussian law. 
Multifractal Random Walks (MRW) can be easily obtained from MRM by compounding a self-similar stochastic process with 
M(t). 



A. Direct definition 

Let the measure Mij'{dt) be defined by 



(4) 
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in the sense that for all Lebesgue measurable set / one has, Mi^t{I) = e^^'-^^'^dt. The process uji^rit) is Gaussian and 
stationary and is defined by its mean and covariance function : 

E[c^i,TW] = -A2 (^ln(^y^ +l) (5) 

and 

f^Mln(?) + l-f), ifO<r</, 
pi,TiT)=Cov[ui,T{t),ui,T{t + T)]=lx''\n{^), if I <T<T, (6) 

[o, if T < T < +00, 

where the parameters T is the integral scale and is the intermittency coefficient. Note that the fact that expression Q 
represents of definite positive function is proven in the next section. Using Kahane Chaos theory [28], on can prove that the 
weak limit 

Mridt) = lim Miridt), (7) 

exists and is non trivial as long as < 1/2. 

Let us remark that the process {uji^T{t)}t can be represented as a stochastic integral of a Kernel against the Wiener White 
noise dB{t), like Mandelbrot- Van Ness fractional Brownian motion representation: 

lui^tH) = -E [iOi,T{t)] + f KuT{t - u)dB{u), (8) 

where the kernel Ki,t (that can be chosen to be causal) satisfies the convolution equation: 

Ki^T * Ki^t) ^ Cov[LOi{0),u;i{t)]. (9) 

A simple Fourier transform of this equation together with expression (|6]l allows one to show that the process uji^t can be seen 
as a kind of fractional Brownian motion in the marginal limit H ^ 0. Indeed, the kernel Ki,T{t) in previous equation behaves, 
in the range I <^t-^T, like: 

Ki{t)-^^- (10) 
For this reason, as emphasized in ref. [13], the MRM measure can be loosely defined as the exponential of an 1// noise. 

B. Alternative definition : continuous cascades 

The previous construction is hard to extend to other laws than the Gaussian law. A more general construction that allows 
one to build continuous cascades with arbitrary log-infinitely divisible statistics has been proposed by Bacry and Muzy [14]. 
It amounts in building the process uji T{t) from a 2d representation. 

We distribute a non centered gaussian white noise V of variance on the half plane {{t,l);t eM.,1 E M+*} using the 
density measure ij.{dt,dl) = l^^dtdl. Consequently, for any measurable set A of the half-plane, ^^{A) is a gaussian random 
variable whose Laplace transform is of the form 



E 



If we choose the mean of the white noise V such that for any A G 5+, one has E[7'(A)] — — Var [7^(-4)] — —2\^p{A), then 

il>{q) = 2A\2 - 2\\. (12) 
Then if we define, for all I and T such that Q <l <T, the cone-like domain: Ai^rit) as: 

Ai,T{t)^[{t\l')-l<l',\t' -t\<]^rmn{l\T)], (13) 
the gaussian process oji^T{t) defined by Eqs (|5]l and (|6]l has the following representation (see fig. [l) 

^i.rit) = l'PiAi,Tit)), (14) 
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Fig. 1. Cone-like domains Ai.riti), Ai^Ti't2) et Ai,T{'ti,t2) used in the definition of continuous cascades (see definition [T3}. The parameter 
T is the integral scale and the parameter I is the small scale cut-off. The limit MRM is obtained in the limit I 0. 

C. Properties 

Many properties of the MRM measure involves the following concave function 

Cm(<7) = q - ^{q) - (1 + 2\^)q - 2\\^ , (15) 

which satisfies Cm(0) = and Cm(1) — 1- The non degeneracy condition < 1/2, can be rewritten in terms of ^Aiiq) '■ 
Proposition 1 (Non degeneracy of M [14]): 

< i 4=» Cm(1) >0=> M is non degenerated and V<, E [M[0, t]] = t (16) 

The limit measure M possesses exact scale invariance properties that are directly resulting from the invariance of the function 
Pi^t{t) as respect to time dilation. Indeed, the covariance function pi^t{t) in Eq. (|6]l satisfies the following invariance properties 

(i) Ps1.st{st) = Pi.t{t), Vs > 0, 

(ii) pi,t[st) = pi,st{t) - In A, Vt < sT, s G [0, 1], 

(iii) PsiAst) = Pi,TiT) - In A, Vt < T, s e [0, 1], 

It follows that the MRM measure AI satisfies 3 scale invariance properties 
Proposition 2 (Scale invariance properties [14]): 

(i) Global scale-invariance 

{MsT[0,st]}t = s{MT[0,t]}t, Vs e R+ (17) 

(ii) Integral scale invariance property 

{MT[0,t]}o<t<sT - Ws{M,T[0,t]}o<t<sT, Vs e [0, 1] (18) 

(iii) Stochastic scale-invariance property 

{MT[0,st]}o<t<T = Ws{MT[0,t]}o<t<T, Vs e [0, 1] (19) 

with, in the last two equations, Ws = se^% where is a gaussian variable independant of M and defined by E [fig] = 
-Var [fls] /2^2X^ Ins 

From (iii) one can easily deduce that q-order moments verify an exact scale invariance property. Actually, one can show that 
Proposition 3 (Finiteness of positive order moments [14]): Let A^ < 1/2. Let q > 0. If Cm(9) > 1 then E [A/tP, t]''] < 
+00. Conversely, let g > 1, if Mr 7^ then E [Mt[0, i]'] < +00 CmC?) > 1- 
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_ 2„^2n(»-l)A^ "tT r(l - 2(fc + l)A^)r(l - 2k\^f 



Moreover, a straightforward computation shows that 

in s 

where Ws is defined in Proposition |2iii) One thus gets 

Proposition 4 {Exact scale invariance of q- order moments [14]): Let < 1/2. 

VqeR, E[MT[0,t]'i] = Kqt'^^^(^\ Vt < T, (21) 

where the prefactor Kg has an analytic formula in the case q = n E N* : 

n-l 

^_^r(2-2(p + fc-l)A2)r(l-2A2)- 

Moreover one can prove that all the negative order moments exist 

Proposition 5 (Finiteness of negative order moments (condition (C4) in [21 ])): Let A^ < 1/2. Then, \fq < we have 

E[AfT[0,i]«] < +00. 

D. The Multifractal Random Walk model 

As said previously, a large class of multifractal stochastic processes can be associated with a given MRM. The simplest way 
is probably the approach initiated by Mandelbrot and Taylor [29] that consists in compounding a self-similar stochastic process 
which increments are stationnary with the non decreasing function AfT[0, t], where Mt is a MRM as build in previous section. 
Another approach, inspired from econometrics, is to consider the measure Mxidt) as a stochastic variance associated with a 
Brownian motion [13], [30]. In this paper, for the sake of simplicity and concision, we will exclusively consider processes 
with stationnary and uncorrected increment constructed from the standard Brownian motion. We define a Multifractal Random 
Walk (MRW) as follows: Let Mt be a (log-normal) MRM and consider B{t) a Brownian motior0 independent of Mt- The 
MRW Xrit) is simply defined as, for all t > 0: 



XT{t)=Bi^MT[Q,t]j (23) 

An alternative construction is obtained by considering the stochastic integral of the measure M; t as respect to Wiener 
measure dB{u) (independent of Mi,t) and then take the (weak) limit I 0; 

t 

XtH) = lim / e'^'-^(")dB(u). (24) 
/-^o+ J 



Let us note that the equivalence between these two definitions is proven in [14] 

The properties of the MRW directly result from those of the MRM and the self-similarity of Brownian motion. 
Proposition 6 (Scale invariance properties [14]): 



(i) Global scale-invariance 

(ii) Integral scale invariance property 



{XsTist)}t = s{XT{t)}t, Vs e R+ (25) 



{XT{t)}o<t<sT - Ws{XsT{t)}o<t<sT, Vs G [0, 1] (26) 

(iii) Stochastic scale-invariance property 

{XT{st)}o<t<T = W,{XTmo<t<T, Vs G [0, 1] (27) 

with, in the last two equations, Ws — s^/^e^^/^, where fls is a gaussian variable independant of M and defined by E [fig] = 
-Var[f7^]/2 = 2A2 Ins 

Proposition 7 (Finiteness of positive order moments [14]): Let A^ < 1/2. Let 

Cx(g)-CM(|) = f(l + 2A2)-^g^ (28) 
Let g > 0. If Cx{q) > 1 then E [Xrit)''] < +00. Conversely, let <? > 1, if E [XT{t)''] < +00 Cx{q) > 1- 

"Note that a simple way to introduce long-range correlations in the MRW model would be to replace the Brownian motion B{t) by a fractional Brownian 
motion Bnit)- 
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A straightforward computation shows that 

In s 

where Wg is defined in Proposition |6fiii). One thus gets 

Proposition 8 (Exact scale invariance of q-order moments [14]): Let < 1/2. 

Vq e R, E [XT(t)^] = Kqt^>^(^\ Vt < T, (30) 

where the prefactor Kq has an analytic formula in the case q — n E : K„ = {2n — 1)!! Kn, where Kn is given by (l22l i. 

Proposition 9 (Finiteness of negative order moments (condition (C4) in [21 ])): Let < 1/2. Then, Vq < we have 
E[XtW«] < +0O. 



E. Discrete time representation of a MRW - Monte Carlo simulation 

Notation 1: For the sake of simplicity, in the following, if Y{t) is a sochastic process, we will use the notation 

drY{t)=Y{t)-Y(t-T). (31) 

Moreover we recall that if M{dt) is a measure, M{t) refers to the non decreasing process 

M{t) = M[0,t], (32) 

and, consequently 

SrM{t) = M{t) - M{t -t)= M[t - T, t]. (33) 

Let fix r > 0. We want to simulate the discrete time process {XT{nT)}n- Approximated Monte Carlo simulation of this 
discrete time process can be obtained using Eqs (|4|i, (|5]l and (|6]l. One first fixes / small enough (I — will be sufficient 
for the purpose of this paper) such that j is an integer. The Gaussian stationary discrete time process {u!i^T(nT)}n can be 
simulated using the analytical formulae of its mean (|5]) and of its autocovariance (|6]l. 

Thus one can easily simulate the measure Mi xidt) that is uniform on each interval of the form [kl, (k + 1)1] with the 
density e^^'.^C^'^. For n > 0, one has 

n-l 

Mi^rinl) = A/,,t[0, nl] = ^ e^'^i.T{ki)i_ (34) 

fe=0 

From these simulations, one can easily simulate the process {Xi T{t)}t>o which is linear on each interval of the form [kl, {k + 
1)1], and which satisfies 

n 

Xi^rinl) = J2 4k]Vle'^'-'^^''\ (35) 
fc=i 

where e[k] is a gaussian white noise which is independant of M. The convergence of the linear- wise process Mi,T{t) towards 
M{t) when I 0, and consequently the convergence of the linear-wise process Xi^rit) towards Xrit), are proved in [14]. 
Thus, simulations of the discrete-time process {Xi^T{m')}n can be seen as a good approximations of simulations of the 
discrete-time process {Xt(?it)}„. 



III. Aggregation properties 

A. Introduction 

One of the nice features of standard Brownian motion is its stability as respect to time aggregation: At each scale, the 
increment probability distributions remain Gaussian. Propositions |2iii) and lUiii) state that both the log-normal MRM and 
MRW processes have stochastic scale-invariance property. This means that, in some sense, they possess stable properties when 
changing the time scale. However, this property is of poor practical interest because it does not provide the probability law 
at a given time scale r but simply indicates how this law changes as r varies. In the log-normal continuous cascade models, 
the multifractality, i.e., the non-linearity of the moment scaling exponent Ca/(9), is fully characterized by the intermittency 
coefficient A^. Empirically this exponent is often found to be close to zero: For instance, the commonly reported value of A^ 
for energy dissipation field Eulerian Turbulence and for the volatility fluctuations associated with financial asset returns are 
respectively A^ ~ 0.2 and A^ ~ 0.02. It is therefore natural to study the properties of the log-normal MRM measure in the 
limit A^ <C 1. In this section, we will see how, in this regime, the law of the MRM can be well approximated by the law of 
an explicit log normal process based on the so-called (normal) renormalized magnitude process il{t). 

In this section, we show that, in this regime, the variations of the MRM or of the MRW are closely related to those of an 
explicit log normal process based on the so-called (normal) renormalized magnitude process n{t). Whereas subsection IIII-CI 
states a convergence theorem of the logarithm of the MRM towards in the limit A^ 0, the other subsections establish 
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different MRM/MRW moment approximation theorems as functions of il{t) moments. All along these sections, the moment 
approximation will be made on the following criterium 

Notation 2: Let {X\{t)}t and {Y\{t)}t be two processes that depend on the parameter A^. Let Mx^{ti, ■■■tn) be a given 
generalized moment of the process {X\{t)}t- Let My^ {ti, ■■■tn), the corresponding generalized moment of the process {Y\{t)}t^ 
Let us consider the Taylor series (for around 0) of these moments. In the case the zero orders as well as the first following 
non trivial orders of these Taylor series are identical for any finite generalized moment, we will write 



Xx{t) ~ Yx{t). 



(36) 



B. The renormalized magnitude fl{t) 

Let us define the process il{t) that is at the heart of our approximation theory: let uji,T{t) be the Gaussian process defined 
in Eqs (|5j and (|6j or Eq. (fT4l i. We define the Gaussian process ili{t) as 



{uJi,Tis) - E[a;;] )ds, 



(37) 



The renormalized magnitude process ^{t) is defined as the weak limit of ^i{t) : 
Theorem 1: The process {r2;(t)}f admits a weak limit when I goes to 0: 

Vl{t) = \mi^Vti{t) (38) 

Proof: The proof of this theorem is a direct consequence of Lemma [T] (Convergence of the finite dimensional laws) and 
Lemma |2] (tightness) of Appendix |A] ■ 
In the sequel, if / = \t — t, <] is some interval, VL{I) will stand for the variation of the renormalized magnitude over this 
interval: 

n{i) = 5rn{t) = n[t) - n{t - r). (39) 

The exact expression of the co variance of the renormalized magnitude can be simply computed using Lemma [T] of Appendix 



Proposition 10: Let r > and h > For all t, one has: 
. if h + T <T, 



Cov 



In 



I h 



where the function f{u) reads 

/(") 

. if h>T + T, 



Infl 



ln(l 



-21n(2), 



if u > 2, 
if u = 1, 
if u = 0, 



(40) 



(41) 



(42) 



Cov[dr^l{t),SM{t + h)] =0. 

Let us note that in the case t <^ h < T + t, Eqs (|40] i and PTt simplify a lot. Indeed, in this case, the function f{u) becomes 
f{u) = -3/2 + 0{l/u), one thus gets 

Corollary 1: Let t h < T + t, then for all t, one has: 



Cow 



= ln( |) +OiT/h), 



(43) 



T T ^ 

We are now ready to formulate the main approximation results one can obtain in the limit of small intermittency A^ — > 0. 



C. Convergence in law towards the renormalized magnitude 

One can prove an asymptotic theorem concerning the logarithm of the measure of an interval. More precisely, one has the 
following result: 

Theorem 2: Let /i, ...,/„, be n arbitrary intervals. When A^ goes to zero we have the following convergence in law: 
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Proof: From Proposition [12] of Appendix |B] and Proposition [13] of Appendix |C] it results that, for all n, 



lim E 



rr 1 1 (^^^^^ 



= E 



(45) 



A simple multidimensional generalization of the Theorem 4.5.5 in [31] allows one to deduce the convergence in law from the 
convergence of the generalized moments [32]. ■ 
The following corollary on the successive incremements of the measure is a direct consequence of the previous theorem: 
Corollary 2: If r > 0, then 



5rM{t) 



(46) 



D. Approximation of the moments of the logarithm of the measure 

The following result will be particularly useful for the estimation of log-normal MRM as discussed in section |IV] below. 

Theorem 3 (Convergence of the magnitude generalized moments): At scale r > the process {2XST^{t)/T}t reproduces 
the Taylor series (in A^), up to the first non trivial order, of any finite generalized moment of the logarithm of the log-normal 
MRM increments (see Eq. ( l36l l for precision on the following notation): 



In 



SrM{t) \ 



2A- 



(47) 



Proof: This result is a direct consequence of Proposition [12] of Appendix |B] and Proposition [T3] of Appendix |C] ■ 
This theorem allows one to obtain approximations of the mean and of the covariance function of logarithm of the MRM 
increments: 

Theorem 4 (Magnitude mean and covariance approximations): For all r > and h > and t, one has 



Cov 



In 



SrM{t) 



In 



SrMit + h) 



(48) 



where the covariance of the increments of the renormalized magnitude is provided by Lemma [T] in Appendix [A] Moreover, as 
far as h + r <T, then the term o(A^) no longer depends on the integral scale T and depends on t only through the ratio h/r 
and goes to when r ^ (with h fixed). Moreover 



E 



In 



SrMit) 



-2\^ In 



o(A^ 



(49) 



where the term o(A^) depends neither on T nor on t 

Proof: The relationship (l48l l directly results from previous theorem. We simply have to show that for h + t < T, o(A^) 
depends only on h/r and goes to when r ^ 0. 

By using the invariance properties ( ITtI i and ( ITSl l of the Proposition [2] we get the equality in law: 



where W^/x is a log-normal random variable that satisfies 

¥sir[\n{Wh/T)] = 4A2 hi 



(50) 



(51) 



From (iSOl l. one can easily prove that the difference Cov[\n{\6rMT{t)\), \n{\SrMT{t + h)\)\ — Yar[ln{Wh/T)] depends only on 
A^ and h/r. The fact that it goes to when r goes to comes from a straightforward argument using the cone representation. 
Moreover, thanks to Lemma [T] in Appendix [A] one obtains, if h + t <T, 



AyCov 



SMit) dM{t + h) 



du 



t+h 



= 4A^ / — / —In 



dv , 



t-T t+h-T 



\u — v\ 



-Var[ln(W^/T)]. 



(52) 



By choosing the new variables u' = u/t and v' — v/t, we can show that the above integral depends only on h/r and goes to 
when r — > (with h fixed). By inserting this expression in Eq. ( i48] l, we thus conclude that the terms o(A^) in this equation 
depends only on h/r. 

A similar computation allows us to deduce Eq. (i49] l. ■ 
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E. Approximation of the moments of the measure 

As far as the generalized moments of the measure itself {STMit)) are concerned. Theorem [3] suggests that they could be 
well reproduced to the first non trivial order by the moments of the process re ^ . It is easy to see that this cannot be true. 
Indeed, the mean of the two previous processes are different simply because the expectation of the exponential of a random 
variable is not the exponential of its expectation. It is therefore necessary to slightly modify the process ^^^^^^^ by changing 
its mean value. 

Theorem 5: Let r > 0. The process |T-e2^'5^f^(*)/'^'2A Var[AVri(t)/T] |^ reproduces the Taylor series (in A^), up to the first 
non trivial order, of any finite generalized moment of the log-normal MRM increments (see Eq. ( |36] | for precision on the 
following notation): 

5rM{t) ~ re'^^-'^ Var[^J (53^ 



Proof: The n-points moment of the rh.s. process can be written as: 



m(ii, . . . ,t„) = T e L ^ JE 



^ ' -'i — 1 



(54) 



If one considers the Taylor series expansion of this expression and replaces the variance of X^iLi ^r^^i) by its expression 
(provided by Lemma [T] in Appendix lAli. one gets: 

m(ii, . . . , t„) = r" + 4t" ^ dm ■ ■ ■ J du,, ^ p{uj - Uk) + o{X^), (55) 

where p is defined by (|99] l. Using Lemma |4] in Appendix |B] it follows 

m{tu . . . , t„) = E [5rM{ti), SrM{t„)] + oiX"), (56) 
which leads to the expected result. ■ 



F. Approximation of the MRW process 

The log-normal MRW process being defined by a Brownian motion subordinated with a log-normal MRM measure, it is 
obvious that the generalized moments of its increments and their logarithm are related to those of the MRM measure. In 
previous sections we have obtained an approximation of these MRM generalized moments. The Theorems [3] IH and |5] naturally 
extend to MRW increments. The following theorems are direct consequences from these theorems. 

Theorem 6: Let r > and {e[n]}„ a gaussian white noise of variance a^. 
The discrete time process |r^/^e[n]e*^^^"'^^/'^~^ Yarls^Q/T]~^ ^ reproduces the Taylor series (in A'^), up to the first non trivial 
order, of any finite generalized moment of the increments of a MRW process X{t) (see Eq. (|36] l for precision on the following 
notation): 

drX[nT) ~ T ' e[n\e ^ L ^ J. (57) 

Moreover the first non trivial order is of order A^. 

Theorem 7: Let r > and {e[n]}„ a gaussian white noise of variance cr^. 
The discrete time process { ln(r^/^) + ln(|e[n]|) + XSr^inr) / t} ^ reproduces the Taylor series (in A^), up to the first non 
trivial order, of any finite generalized moment of the absolute increments of a MRW process X{t) (see Eq. ( l36b for precision 
on the following notation): 

ln\SrX{nT)\ ~ iln(T)+ln(|e[n]|) + A^^^^^. (58) 

2 T 

As in previous theorem, the first non trivial order is of order A^. 
Theorem 8: For all t > et /i > 0, one has, for all t 

Rr {h) = Cov[\n{\dr X{t)\)M\3rX{t + h)\)] = —S{h) + X'^Cov 

8 

where the covariance of the increments of the renormalized magnitude is provided by Lemma [T] in Appendix |A] Moreover, as 
far as h + r <T, then the term o(A^) no longer depends on the integral scale T and depends on t only through the ratio h/r 
and goes to when r — > (with h fixed). Moreover, one has 

E[ln(|5.X(nr)|)] =-l±|^-A2ln(^^)+o(A2), (60) 

where the term o(A^) depends neither on T nor on t and 7 is the Euler constant. 



(59) 
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In the case t <^ h < T + t, Eq. (|43T l gives an approximation of the auto covariance of the renormalized magnitude. It can be 
used to get an approximation of the auto covariance Rr{h), i.e.. 
Corollary 3: Let t h < T + t, then for all t, one has: 

Rr{h) = Cov[\n{\SrX{t)\)M{\SrX{t + h)\)] = A^ln (^^^ + X^O{T/h) + o{X^). (61) 

where the term o(A^) no longer depends on the integral scale T and depends on r only through the ratio h/r and goes to 
when T ^ (h fixed). 

IV. Parameter estimation 

We have seen that a log-normal cascade model is mainly defined by 2 parameters (apart from the variance parameter cr^ 
which is a simple multiplicative factor for the MRW): the integral scale T and the intermittency coefficient A^. Among the 
huge literature devoted to multifractal models and multifractal analysis, there are only very few papers that focus on issues 
related to parameter estimation or related statistical questions (see however [23]-[27]). 

A simple method to estimate A^ would consist in performing a regression of the empirical ({q) function estimated from 
the scaling behavior of the empirical moments. However this method is far from being robust, the variance of this estimator 
converges very slowly (of the type N~^^" with a > 0, see [23]). This method is however sufficient to establish the pertinence 
of the approximation A^ <C 1 in many empirical situations like the analysis of turbulence or financial time series [11], [30]. 
The starting point of our approach of parameter estimation is therefore to assume that we are in the small intermittency regime 
A^ <C 1 and that the results of section |lll] can be used. 

Let = ^ be the total number of samples available, where L is the observation scale and t the sampling period. 
Consequently, the observed samples corresponds to the values 

{A:T(nT)}„g[o,Ar[, where N = ^. (62) 

The estimation problem must be studied in the asymptotic regime A^ +oo. However, this limit can be achieved in two 
different ways. The first one, referred to as low-frequency regime, corresponds to the case where t is fixed and L +00. 
In the second one, referred to as high frequency regime, L is fixed but the and r ^ 0. From a numerical point of view, 
L ^ T, corresponds to the low-frequency regime whereas t ^ T corresponds to the high-frequency regime. In both cases 
N = L/t ^ +00. In the particular case where one has both L ^ T :s> t, the effective asymptotic can be considered to be 
the high (resp. low) frequency regime if ^ <c ^ (resp. ^ ^ ^). For discussions on mixed regime for which L +00 and 
r ^ at the same time, we refer the reader to [33]-[36]. 



A. GMM in the low frequency regime, L — > +00 

The first application of GMM to estimate multifractal models can be found in econometric literature. More precisely, Calvet 
and Fisher [27], [37] used this method to estimate the parameters of a simple cascade model where the random weights follow 
a binomial law. Their work has been further developed by Lux [25], [26]. 

It is easy to see that the three parameters A^, T and a^, are directly related to some moments associated with MRW 
increments or their logarithm. It is therefore natural to use a GMM to estimate these parameters. GMM was initially proposed 
by Hansen [38] and can be described as follows: 

Let us consider the process {Zt [kWk of the logarithms of absolute increments of some MRW process at size r : 

Z^J>\k]=\n\6rX[k]\. 



This process is characterized hy p — Z parameters : 



{\n{a),\\\nT] 



(63) 



(64) 



ifj \ (0 ) 

Given some observation {Zr ° [k]}k, let us denote ° [k], 9) the moment function of dimension r > p, which satisfies the 
following moment condition: 



E 



/(Zf")[fc] 



0, if and only if 6* = 6*0. 



(65) 



In our case, it is natural to choose the variance of the process order to estimate and the empirical covariance 



of zf°'^ at various time lags in order to estimate A^ and T. This leads us to consider 

/ exp(2Z|^'')[fc] \ 

{Z^'°\k]-^ie){Z^'"\k-h^]-pe) 



{zi'°\k]-^,e){\z'f''\k- 



( 



Me) / 



Ce[hi 
V Ce[hK] I 



(66) 
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where 

^le^^[Zr[k]] (67) 

and 

Ce [h] = Cov [Zf ) [fc] , ) [k-h]], (68) 

and hi, ... , hx are iiT different positive lags. Let us note that, a first order (in A^) analytical expression of Ce[h] = Rrihr) 
is provided by Eq. (|59] l. 

The moment condition ( |65] l can be approximated by using the empirical mean: 



1 ^ 

The GMM estimator is then simply defined by 



N 



e ^ argming(gjj Wn 5jv ) , (70) 

where Wn is a sequence of weighting matrices that converges, when N +oo towards some matrix positive definite Wqo. 
Hansen has established the following result: 

Theorem 9 (Hansen [38]): If the following hypotheses hold: 

• The process {z!f"''[k]}k is ergodic, 

• The series {,f{Zr [k],9)}k satisfies a central limit theorem, i.e., 

-^^/(zf")[fc],0)^AA(O,y,), (71) 



where the matrix Vg is defined as: 

M — ^+oo ^ — ^ L 



M 



The (r x p) matrix Dgpf = has full rank (p) and converges towards 

df{zi'°\k],e) 



Df^E 



de 



(72) 



(73) 



then, the GMM estimator 6 is consistent and verifies 

VN{e-e) ^U{o,E), (74) 

where 

S = {DfWooDfy^DfWooVg„W^Df{DfWooDfy\ (75) 

Moreover, the estimator 9 is optimal if PFoo — Vg^^ as defined in Eq. dT^j i. In that case the asymptotic covariance of the 
estimator is 

^opt^{DfV,yDf)-\ (76) 

In practice [39], it is obviously difficult to use the optimal weighting matrix Woo — Vg^^ since one does not know the vector 
Oq. One usually proceeds using the following iterative algorithm: 

1) Choose some arbitrary initial weighting matrix Wn, such as IAn, 

2) Compute the GMM estimator dTOl l using this matrix Wn, 

3) Replace the weighting matrix by Wn — V^^, where 9 is the obtained estimated parameter vector. 



4) Repeats step 2 and 3 until successive estimates are sufficiently close one to each other. 
Confidence intervals for 9 can be obtained using Eq. (|74] i. 

One can easily show that the hypothesis of Theorem |9] hold in the case the moment function is defined by ( |66] |. However, 
there is one major problem for implementing the corresponding GMM method : we do not have any analytical expressions 
neither of ^ig (Eq. dSTJi) nor Cg[h] (Eq. (|68ll), nor Vg (Eq. (|72]i) . Actually Eqs ^ and ^ of Theorem g] give analytical 
approximations (up to a o(A^) term) to both ^g and Cg{h\. Let us note that these very same equations also allow to derive an 
analytical expression (up to a o(A^) term) of Vg [32]. It is tempting to use these approximations in the moment function ( |66] l 
and for the weighting matrix and try to use the exact same GMM algorithm. This is exactly the framework of the so-called 
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GMM estimation in a misspecified model (see [39]). The model is considered as misspecified since the moment function no 
longer satisfies ( [65l l. Instead, one has 

E [/*(Z(^°)[fc],6l)j = r{6), for all k, and ||r(6l)|| > for all 9, (77) 

where /* corresponds to the moment function ( |66] l in which we have substituted and Ce[/i] by their approximations. Now, 
if we suppose that there exists 6* such that 

E[r{Zeo,O*fWooK[r{Ze„,9*]<E[f*{Ze„,9fWooE[r{Zg„,0], yO ^ 9\ (78) 

using the results of [39], one can show that the so-obtained approximated GMM gives a consistent asymptotically gaussian 
estimator of 9*. Moreover, we expect 

9*^9 + o{\^). (79) 

In order, to illustrate this estimation method, we have run a Monte-Carlo test on MRW realizations. The results are shown 
in Table J] Each MRW was simulated on a discrete time grid of period r = 1 and of various size L using the algorithm 
described in Section III-Bn . The number K of different lags hk used in the moment function (l66T l is X = 43 and the lags hk 



are approximately logarithmically distributed between 1 and 150. 

For each set of parameters, we simulated 10000 realizations of such MRW and ran the misspecified GMM algorithm on 
each of these realizations. For each parameter (In(cr), A^, ln(r)) we computed associated GMM estimators (In(CT), A^, ln(T)). 
We then computed for each of them the so-obtained bias (the Bias column), the mean square error (MSB column) and we ran 
the Kolmogorov-Smirnov [40] test for testing the normality of the estimations. The corresponding p-values for this test are 
indicated in the KS column. Thus, for instance, a 5% level test is satisfied if the p-value is greater than 0.05. 

Clearly the effective value of a will slightly affect the performance of the GMM algorithm, since it just corresponds to 
a multiplicative factor Thus, in all the numerical experiments we arbitrarily set it to cr = 1 (i.e., In(cr) = 0). The global 
scale invariance property ( fTTj i shows that changing the value of the parameter T amounts to changing the number of samples 
L/t = L (since t = 1) of the realizations. Consequently, the realizations only depend on the ratio L/T, i.e., the number of 
integral scales in a realization. We arbitrarily choose to fix T and have L varying. In this section, we only adress the low 
frequency regime T <t: L. We choose T = 200 (i.e., ln(T) ~ 5.298...) and L among {2048,4096,8192,16384,65536}, i.e., 
the number of integral scales L/T varies from 10 to more than 320. We are thus left with only A^ as a "free" parameter. We 
used two different values for A^ : 0.02 and 0.04. Thus, two different sets of parameters were used : the first set (top half of 
Table HI) corresponds to cr = 1, A^ = 0.02 and T = 200 and the second set (bottom half of Table IJi corresponds to cr = 1, 
A^ = 0.04 and T — 200. Let us note that adding some more lags hk (i.e., increasing K) does not significantly improve the 
results (see the line corresponding to size L — 16384* in Table |T] which corresponds to X = 69 instead of K = 43). 

For all parameters. Table U shows clearly that the MSB is entirely dominated by the variance (the bias contribution is 
negligeable). Let us discuss the results obtained for the estimation of each parameter one after the other. 

• logcr : Clearly, the theoretical GMM asymptotics for the parameter \n{a) is reached as soon as iV = 2048. This is 
indicated both by the fact that the MSB decreases as 1/ \/Z and that the Kolmogorov-Smirnov normality test has a very 
high p-value as soon as > 2048 (for N = 2048, the p-value is almost 40% when A^ = 0.02 and almost 25% when 
A^ = 0.04). 

• A^ : For A^ the situation is somewhat different. Though the estimation is surprisingly good even for the shortest realizations 
in the sense that the MSB is very small, the normal asymptotics cannot be considered to be reached when N < 16384, 
i.e., when the number of integral scales L/T is smaller than 80. 

• InT : Here the GMM asymptotics for parameter ln(T) is the slowest. Though the MSB is small for N > 16384, the 
normal asymptotic can hardly be considered to be reached even for N — 65536. 

Let us note that, in any case, the o(A^) term in (|79]l due to the mispecification of the model hardly shows up in these results. 
Indeed, we expect a bias of the order of A"* (i.e., the "next" order after A^), thus of the order of 4e — 04 for the top half of 
the Table (A^ = 0.02) and 1.610"^ for the bottom half (A^ = 0.04). Bxcept for the case A^ = 0.04 and for the estimation of 
the parameter A^ (for which the bias saturates around 210^^), there does not seem to be any trace of this term : L is not large 
enough. Bven when the bias saturates around 210—5 (for an MSB of 0.0015), in order this saturation value to dominate the 
MSB, L should be of the order of 10^^! Thus, though the model is theoretically mispecified, from a practical point of view, 
it can be considered as well specified. 

As a conclusion to this section, we can state that the GMM estimations are reliable in the low frequency regime, however, 
except for cr the normal asymptotic confidence intervals should not be used. Monte Carlo simulations should be performed to 
get confidence intervals. 



'As explained in this section, we chose I = 128. Let us note that increasing / does not significantly change the numerical results 
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r = 1 


In <j 


= (cr = 


1) 




= 0.02 




InT ~ 


5.298.. 




= 200) 


L 


Bias 


MSB 


KS 


Bias 


MSB 


KS 


Bias 


MSB 




KS 


2048 


-5e-03 


0.070 


0.39 


5e-04 


0.0072 


3e-08 


-0.013 


1.15 




2e-72 


4096 


-2e-03 


0.049 


0.49 


3e-04 


0.0048 


le-03 


-0.026 


0.76 




le-33 


8192 


-6e-04 


0.034 


0.67 


le-04 


0.0032 


2e-03 


-0.015 


0.50 




9e-16 


16384 


-8e-04 


0.024 


0.56 


2e-05 


0.0022 


0.08 


-0.009 


0.34 




6e-8 


16384* 


-9e-04 


0.024 


0.54 


-2e-05 


0.0022 


0.08 


0.005 


0.35 




5e-09 


65536 


-2e-04 


0.012 


0.49 


6e-06 


0.0011 


0.45 


-0.002 


0.17 




0.01 


T = 1 


In a 


= (o- = 


1) 




= 0.04 




InT ~ 


5.298.. 


(T 


= 200) 


L 


Bias 


MSB 


KS 


Bias 


MSB 


KS 


Bias 


MSB 




KS 


2048 


-le-02 


0.110 


0.24 


7e-04 


0.0095 


9e-05 


-0.130 


0.88 




5e-32 


4096 


-5e-03 


0.072 


0.34 


4e-04 


0.0064 


0.01 


-0.054 


0.59 




4e-18 


8192 


-3e-03 


0.050 


0.48 


2e-05 


0.0044 


0.06 


-0.027 


0.41 




3e-6 


16384 


-2e-03 


0.035 


0.52 


-2e-05 


0.0031 


0.08 


-0.014 


0.28 




2e-5 


65536 


-4e-04 


0.018 


0.42 


-4e-05 


0.0015 


0.40 


-0.002 


0.14 




0.05 



TABLE I 

GMM ESTIMATION OF MRW PARAMETERS. BACH LINE CORRESPONDS TO GMM ESTIMATION AS EXPLAINED IN SECTIOn HV-AI oN 
10000 REALIZATIONS OF DISCRETE-TIME MRW WITH T = 1 AND OF SIZE L. L VARIES FROM 2048 TO 65536. THE LAGS hk USED FOR 
GMM ESTIMATION IN BQ. ARE CHOSEN SUCH THAT if = 43 AND APPROXIMATELY LOGARITHMICALLY DISTRIBUTED BETWEEN 1 
AND 150 EXCEPT FOR THE LINE CORRESPONDING TO iV = 16384* FOR WHICH MORE LAGS WERE TAKEN (K = 69). THE MRW WERE 
MONTE-CARLO GENERATED USING THE ALGORITHM DESCRIBED IN SECTIOn HI-BI TWO SETS OF PARAMETERS WERE USED : a = 1, 
A^ = 0.02, T = 200 FOR THE TOP HALF AND CT = 1, A^ = 0.04, T = 200 FOR THE BOTTOM HALF. 



B. GMM estimation in the high frequency regime r ^ - Estimation of the nature of the asymptotic regime 

In many practical situations (e.g., when dealing with financial time series) the data are sampled at some high frequency 
T <C r over a time period L that is smaller than (or of the order of) the integral scale T. As akeady explained, in that case, the 
right asymptotic regime to consider is the high frequency regime r — > 0. Let us try to understand how behaves the previously 
described GMM procedure in that context. As we have already pointed out, the moment function ( l66b involved in the GMM 
has two types of components : the first component corresponds to the empirical variance of the increments of the MRW process 
itself and basically allows one to estimate while all the other components correspond to the empirical covariance of the 
logarithm of the same increments and allow one to estimate T and A'^. 

According to Eq. ( |26] |. the log-normal MRW process {XT{t)}t<L satisfies the following equality in law: 

{^T(t)},<^ = {M/L/T^LW},<^, (80) 

where /t is a log-normal random variable which law is given in Proposition |6] and which is independent of the MRW 
process {XL{t)}t<L which integral scale is equal to the observation scale L. Given some sample of length L of the MRW 
process, the variable /t takes a fixed value and can be considered as a simple multiplicative factor that simply changes the 
variance of the process. Consequently, the estimation problem of both tr^ and T is ill-posed. It is fundamentally impossible 
to estimate independently the integral scale T and the variance cr^ of the process since they both appear as a multiplicative 
factor of the whole process. T is no longer a "true" parameter of the model it can be arbitrarily fixed. Moreover, even if we 
knew the true value of T, there is no chance for the GMM variance estimator In a to converge to in the asymptotic limit 
T since it is easy to show that the empirical variance of the increments, itself, does converge in the limit r towards 
a random variable (see e.g. [14]). Hence, in this regime, the first hypothesis upon which GMM relies, namely the ergodicity 
of {ZT-[n]}n is not satisfied. 

Since the value of T is the key to decide in which asymptotic regime one is (L ^ T for the low frequency regime and 
T ^ T for the high frequency regime), it is of fundamental interest to understand how the GMM estimation of T behaves in 
the high frequency regime t ^ 0. Actually, the GMM estimation of In T (and of A'^ ) basically consists in fitting the empirical 
covariance of the logarithm of the increments of the MRW process. Thus, it is natural to study the mean of this empirical 
covariance, in the high frequency regime r — > 0. 

Proposition 11: Let us consider the fixed observation scale L < T where r is the sampling scale and L/t is the number 
of samples of the MRW process. We introduce the empirical covariance Rr[n] and the empirical mean jlr,M '■ 

L/r-n-l M 

RAn] = j (In |<5.X(fcr] I -Ar.L/r) (ln|<5rX((fc + n)T] I- /V,L/r), Ar.Af = ^ I'^-^^^^H- (^1) 

fe=l fe=l 



Let h > 0, then the expectation of the empirical covariance Rr[h/T], in the high frequency asymptotic r ^ 0, is 



lim E 



(82) 
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r = 1 


In a 


= (cr = 


1) 




= 0.02 




IriT ~ 


9.704.. 


(T = 16384) 


L 


Bias 


MSE 


KS 


Bias 


MSE 


KS 


Bias 


MSE 


KS 


8192 


-0.04 


0.27 


.55 


le-04 


0.003 


0.26 


-1.98 


2.44 






TABLE II 

GMM ESTIMATION OF MRW P ARAMETERS IN THE CASE T/r > 1 AND L <T . EACH LINE CORRESPONDS TO GMM ESTIMATION AS 
EXPLAINED IN SECTIOn HV-AI ON 10000 REALIZATIONS OF DISCRETE-TIME MRW WITH T = 1 AND OF SIZE L = 8192. THE LAGS hk 
USED FOR GMM ESTIMATION IN EQ. ||66} ARE CHOSEN SUCH THAT K = 43 AND APPROXIMATELY LOGARITHMICALLY DIS TRIBU TED 
BETWEEN 1 AND 150. THE MRW WERE MONTE-CARLO GENERATED USING THE ALGORITHM DESCRIBED IN SECTIOn HI-EI 



Proof: One can easily prove the following general relation that gives the expectation of the empirical correlation function 
of a given process : 

E [Rr[h/T]\ - Rr{h) = -Var[Ar,L/r] - ^ + Var[Ar.L/r] - 2Cov[Tlr,h/r,'fir.L/r\) (83) 

According to Eq. ( |6TI ) of Corollary [3] under the condition L < T, the covariance function Rr{h) for ft, > 0, in the high 
frequency asymptotic r ^ 0, is given by 

(84) 



hniRrih) = A^ln ( - 

Moreover, from the definition of the empirical mean, one can write the following equations 

L L 



\im War [il^^L/^] ^ j ^ J ^ 



T 



and 



hmLov[iir,h/T, IJ't,l/t\ =A / / 1^ 



T 



\u — v\ 



n 
8L 



8L' 



(85) 



= In 



/Te3/2 



\ L J " 2L 

Inserting these last three equations in Eq. dSST l leads to the expected result 



In 



(L_-hf 
2Lh 



In 1 - 



TT 

8L' 



(86) 



Let US remark that liiHr^a E 



Rr[h/l 



does not depend on the integral scale T. This is not surprising considering the 
remark we just made at the beginning of this section. Now, the leading term of Eq. ( [82] i. when L ^ h is 



lim E 



Rr[h/T] 



y In 



h 



(87) 



ln(Le"^/2) ^ \n{L) - 3/2, independently of the "true" 



Identifying this equation with Eq. ( [6l1 l. shows that we expect 

(i) the estimator A^ to be unbiased and 

(ii) the mean of the estimator In T to be of the order of E In T 
integral scale T value. 

These results are ilUustrated in Table |ll] which displays the output of the GMM estimators described in the previous section. 
The estimations where computed using a realization of size L = 8192 (r = 1) of a MRW process with parameters a = 1, 
A^ — 0.02 and T = 16384. The choice T > L t clearly corresponds to the high frequency regime. This table uses the 
same format as Table I] : for each parameter (In cr, A^ and In T), the bias, the mean square error (MSE) and the p-value of the 
Kolmogorov Smirnov normality test (KS) are computed using a Monte-Carlo method with 10000 realizations. 
Let us discuss the results obtained for the estimation of each parameter one after the other. 

• Incr : As expected, the estimator of \n<7 has both a very high bias and mean square error (it does not converge!). 



In T : the estimator of In T is biased, its mean is found to be E 



InT 



9.704 - 1.98 = 7.724 which is very close to 



the expected order \n{L) — 3/2 ~ 7.51. This can be used as a way to detect the fact that we are in the high frequency 
regime. 

A^ : On the contrary, the estimation of A^ is excellent, the bias and the MSE are of the same order as the ones obtained 
in Table U In the following section we prove that an estimator of A^ based on the regression of the empirical covariance 
function of the logarithm of the increments of a MRW is an unbiased and consistent estimator. 
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C. A convergent estimator of in the high frequency regime r ^ 
We consider the sequence of the absolute increments of an MRW 

Zr[k]=\n\5rX[k]\. (88) 

According to Theorem [8] and Proposition [TOl for any integer n > such that nr < T, one has 



R^{nT) ='Cov[Zr[k],Zr[k + n]] ^A^^ln^^^— ^ - /W^ +o{X'^), 



(89) 



where o(A^) depends only on and n and where the function f{n) is defined by Eq. (l4Tl l. It follows that, if n and n' are two 
different integers such that < rtr < T and < n'r < T, the difference Rt^tit) — Rr{n'T) does not depend neither on the 
integral scale T nor on the sampling scale t. This naturally leads us to the estimation of A^ relying upon a simple regression: 

A?= "''"'-"-M , (90, 

g[n) - g[n') 

where the empirical covariance Rr is defined by (ISTT i and the function g{n) by 

= /(n) + ln(n). (91) 

We then have the following theorem 

Theorem 10: Let n, 7i' two different integers such that < nr < T and < n'r < T. In the high frequency asymptotic 
regime r ^ 0, the estimator defined by (l90l l is biased with an asymptotic bias of the order of o(A^). Moreover it is consistent 
and its variance decreases as 

JHN) 



Var[A2]=Of^j, (92) 



where N = L/t. 

Proof: The proof for the first assertion is straightforward : the estimator ( |90l ) has a bias of the form 



E 



A2 



\' =0\^]+o{\^). (93) 



The hard part of this theorem is to prove the consistency and the speed of convergence. The rigorous proof is tedious and we just 
give in Appendix |D] the main points of this proof, leaving to the reader some uninteresting and long (though straightforward) 
computations. ■ 



V. Application to financial time-series 

One of the most important problem in finance is the modelling of price fluctuations of a risky asset. Since Mandelbrot 
famous work on the fluctuations of cotton price in early sixties [42], it is well known that speculative price variations are 
poorly described by the standard Geometric Brownian motion (see e.g., [43]-[47]) that does not permit to explain the well 
known intermittent and correlated nature of volatility variations [2], [48], [49]. During the last decade, the availability of huge 
data sets of high frequency time series has permitted intensive statistical studies that lead to uncover a very rich and non 
trivial statistical structure, that is to some degree universal across different assets. These empirical studies have suggested 
that financial data share many statistical properties with turbulent velocity "intermittent" fluctuations and notably display 
multiscaling properties [2], [11], [50], [51]. In that respect, the phenomenology of multifractal models [26], [34], [50], [52] has 
provided new concepts and tools to analyze market fluctuations and in particular the log-normal MRW disussed in this paper 
has been shown to account very well the return fluctuations and volatility correlations over a wide range of time horizons. In 
this section we use the previous GMM method to calibrate the MRW model from daily return time series. We also show that 
this model provide a simple way to forecast Value at Risk with better performances than classical GARCH models. 



A. The financial time-series 

The financial data we have used in this section are daily (r — 1) close prices of some french stocks that are part of the 
CAC40 french index. This index is computed using 40 of the largest french stocks of the euronext market. We only kept those 
with the longest historic. Thus the data consist in the close prices P{t) of 29 stocks between the years 1990 and 2005Q. Thus 
the time-series associated with each stock has L = 3770 samples. We use here the MRW process as a model for the log of 
the price \nP{t). Since the GMM estimation described in section ITV-AI is partly based on the computation of the logarithms 
of the increments of the MRW, in the case the close price does not change from one day to the next, one cannot compute this 
increment. We chose, in that case, to change the value of the second price randomly by one tick up or down (the tick is the 
smallest effective change of the price of a stock). 

*We have adjusted these prices taking into account the dividends and the eventual splits. 
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TABLE III 

The first three columns correspond to the estimations of In a, \^ AND In T GIVEN BY THE GMM. THE NEXT TWO COLUMNS 

SHOW THE CORRESPONDING VALUES OF a AND T. 



The problem of risk estimation or forecasting is essential in quantitative finance. However, several risk measures can be 
used. In this paper, we address two of them, which are widely used : the (historical) volatility and the Value at Risk (VaR). 
In any case, risk forecasting will make extensive use of the aggregation properties of section |III] Before explaining how the 
forecasting is perfomed, one should estimate the MRW parameters. 

B. Parameter estimation using GMM 

In this section, we present the results of the GMM estimation as described in section IIV-AI They are sum up in Table |lll] 
The first three columns show the estimations of In cr, and In T given by the GMM for each stock. And the next two columns 
show the corresponding values of a and T. 

One can see that, in most cases, the estimation of T lead to values greater or of the order of L = 3770. For those, the low 
frequency regime is clearly not reached. The observation scale L is not large enough compared to the integral scale. However, 
one can consider that the high frequency regime is reached T ^ 1. Thus, as shown in section IIV-BI (i) the estimator of a 
does not converge, (ii) the estimation of T is not reliable and depends essentially on L however, (iii) the estimation of A'^ is 
reliable. The only way to get confidence intervals is to use Monte-Carlo. We computed 5% confidence interval using 15000 
realizations of the MRW process with a — \ (we normalized the logarithm of the stock prices), A^ = 0.02 and T — 3770. 
We got A2 e [0.013, 0.027], and f e [200, 250000]. This shows that all the results in Table HI] are compatible (at a 5% level) 
with a single set of parameters ; A^ — 0.02 and T — 3770. 

C. Volatility forecasting 

Volatility is a model dependent notion. For instance, for GARCH models [53], at a given time the conditional volatility 
(to all the observed past) is a deterministic number whereas for stochastic volatility models as well as for the MRW model 
it is a random variable. In order to compare different models on what is generally referred to as "volatility forecasting", one 
needs to define a common problematic. The problematic we consider here is the forecasting of absolute returns : forecasting 
\5sX{t[) + h)\ knowing all the past data {5rX{ty}t<to (with s > t — 1 day). The parameter s will be referred to as the 
prediction scale and h as the horizon. 
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TABLE IV 

Forecasting performance of the three MRW-based methods (MRWLin, MRWSq, MRWLog) depending on the error 
criterion (either : mse or : mae) and on the scale s and horizon h forecasting. each entry corresponds to 
the number of stocks (out of 29) for which the corresponding mrw-based forecasting beat both garch( 1 , 1 ) and 

TGARCH( 1 , 1 )-BASED FORECASTING. 



Three methods will be used for volatility forecasting using MRW model. The first one, referred to as MRWLin simply 
corresponds to solving the linear prediction problem of estimating 6sX{to + h) as a linear combination of {5r^(i)}t<to- For 
this purpose we need analytical expression of the mean and the auto-covariance of the increments process. This is given by 
Eq. ( ISTT i of Theorem |6] along with Eqs (|40] i, ( 1411 1 and ( l42l i of Proposition [10] The second one, referred to as MRWSq, is 
based on the exact same equations. It simply corresponds to solving the linear prediction problem of estimating \6sX{tQ + 
as a linear combination of {\ST-X{t)\'^}t^tg. The last one, referred to as MRWLog simply corresponds to solving the linear 
prediction problem of estimating In \6sX{to + h)\ as a linear combination of {In \SrX{t)\}t<to- For this purpose we need an 
analytical expression of the mean and the auto-covariance of the logarithm of the increments process. This is given by Eqs 
(l48T l and dHJ of Theorem H] along with Eqs (gOli, (HB and (|42ll of Proposition [TOl 

We compare the results of the MRW-based forecasting with econometric models that are standard for volatility forecasting. 
We use the standard GARCH(1,1) model (with normal innovations) and the t-student GARCH(1,1) model (with t-student 
innovations) referred to as the tGARCH(l,l) model [54]. 

For all the MRW-based forecasting, we use the same two parameters — 0.02 and T — L = 3770 (in the previous section, 
we have seen that these values were compatible with the estimations performed on most of the stocks). In order to make the 
GARCH-based forecasting harder to beat, we choose to estimate the GARCH parameters using maximum likelihood estimators 
[55] separately on each stock time-series, using the entire time-series. Thus, we perform in-sample GARCH-based forecasting 
and out-of-sample MRW-based forecasting. The forecasting errors are computed using both an norm (MSE) and a norm 
(MAE). Table |IV] displays the number of stocks (out of the 29) for which the corresponding MRW-based forecasting beat 
both the GARCH(1,1) and the tGARCH(l,l) forecasting. This is done independently for each error (MSE or MAE) and for 
different horizons and scales. We see that for MAE error all MRW-based forecasting clearly outperform both GARCH(1,1) 
forecasting at any horizon and any scale. MRWLin performs even better than the two other MRW-based methods. For MSE 
error, both MRWLin and MRWSq forecasting outperform both GARCH forecasting with a preference toward MRWSq (which 
is not surprising since the corresponding linear prediction, by definition, minimizes the mean square root error). 

D. Value at Risk forecasting 

Given the log return process X{nT) (remember that r = 1 day in our case) and the present time — uot, the conditionnal 
Value at Risk VaRp at confidence level p, at scale s > r and at horizon h> is defined by he relation 

V[6sX{to + h) > -YaRp{to)\X{nT),n<no] ^ p. (94) 

It thus corresponds to the maximum loss on a given scale and horizon and at a given confidence level. The highest it is the 
riskiest the asset is. 

The estimation of VaRp in the case X is an MRW process is based on Eq. ( fSSl l of Theorem |7] This equation means 
that the process In |(5T-X(nT)| can be seen (in the first order in as the sum of the logarithm of a white gaussian noise 
e[n] and of the renormalized magnitude which is gaussian and independant of e[n]. Thus, at time to, the conditionnal law of 
\SrX{nT)\ and consequently the associated value at risk can be estimated using an estimation of the conditionnal law (i.e., the 
conditionnal mean and variance) of the renormalized magnitude. Conditionnal mean and variance estimations can be performed 
solving, as in the previous section, the linear prediction problem of estimating \n\SsX{to + h)\ as a linear combination of 
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nGARCH tGARCH MRW 



p s = 1 day, h = \ day 

03% TTTTS 27 4 24, 23 

1% 21, 21 0, 2 21, 20 

5% 16, 13 0, 22, 13 

10% 3, 5 0, 24, 20 

20% 2. 2 0, 25, 24 

p s = 1 day, /i = 6 days 

03% TTTO 27 7 26, 20 

1% 22, 15 1, 1 22, 16 

5% 11, 7 0, 20, 10 

10% 2, 3 0, 21, 12 

20% 0, 1 0, 25, 18 

p s = 1 day, h = \ \ days 

03% 97U) 579 26, 19 

1% 22, 15 1, 1 24, 14 

5% 11, 6 0, 20, 8 

10% 2, 3 0, 22, 9 

20% 0, 0, 26, 19 

p s = 5 days, h = 5 days 

03% 227^1 22722 23, 21 

1% 23, 24 14, 19 26, 24 

5% 23, 24 4, 4 21, 24 

10% 19, 18 4, 4 23, 24 

20% 14, 17 2, 7 21, 22 

p s = 5 days, h = 10 days 

03% 25720 237T8 26, 18 

1% 26, 25 14, 17 24, 23 

5% 22, 19 3, 4 23, 24 

10% 15, 18 1,4 24, 24 

20% 10, 14 0, 4 21, 22 



TABLE V 

Number of stocks (out of 29) that are accepted by the Kupiec test (left number) and the Christoffersen test 

(RIGHT number) FOR A CONFIDENCE LEVEL OF 95%. BOLD FACE NUMBERS CORRESPOND TO THE CASE WHERE THE MRW-BASED 

estimation passes the test more times than both GARCH and tGARCH based estimation. 



{In \ drX{t)\}t<to- The prediction obtained corresponds to the conditionnal mean estimation and the variance of the prediction 
corresponds to the conditionnal variance estimation. 

We use two different tests for testing the estimated conditonnal value at risk. They are both based on the series 

fl, if S^X (nor + h)<-Y.R„ 
^ [0, if SrX{noT + h)> -YaRp, 

One can easily show that the process {/p[n]} is a Bernoulli process with parameter p. Thus, the first test, generally called the 
Kupiec test [56], is based on the equation 

F[lp[n] = 1] = p. (96) 

Thus, this test does not take into account the dynamic of the {/p[n]}. The Christoffersen test [57] does. It is based on the fact 
that 

P[lp[n] = l\lp[n - 1] = 0] = F[lp[n] = l\lp[n - 1] = l] = p. (97) 

The results of these tests are illustrated in Table |V] for both MRW-based estimation (using the estimation of the renormalized 
magnitude conditionnal law as described above) and GARCH-based estimation []. As in the previous section, the MRW 
parameters and T are fixed once for all (A^ = 0.02, T = 3770) whereas, the GARCH (and tGARCH) parameters are 
estimated using a maximum likelihood estimation independantly for each stock on the entire time-series. Each entry of Table 
rvl is composed of two integers separated by a comma. The left number is the number of stocks (out of 29) which passed the 
Kupiec test, the right number to the number of stocks that passed the Christoffersen test (both tests are performed using a 
confidence level of 95%). Bold face numbers correspond to the case where the MRW-based estimation passes the test more 
times than both GARCH and tGARCH based estimation. 

The superiority of the MRW-based estimation over the GARCH or tGARCH-based estimation appears very clearly at all 
scales and horizons and at all level p. 
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VI. Conclusions and prospects 

In this paper, we have reviewed the main properties of log-normal continuous cascades and developed an approximation 
framework in the limit of small intermittency. We have shown that, within this approximation, the law of the the process 
increments, at each time scale, can be expressed under simple analytic forms so that the process aggregation properties are 
easy to control and deal with. As far as parameter estimation problems are concerned, we have pointed out that one has to 
distinguish "low" and "high" frequency asymptotic regimes according to which the properties of the samples are somehow 
different. In the "low" frequency regime, one considers samples of abritrary increasing sizes at fixed sampling frequency. In 
that case, we have shown that the model parameters can be estimated with a GMM method mainly relying upon empirical 
covariance function of log-increments of the processes. The high frequency regime corresponds to a situation when the process 
is observed over a finite lenght and sampled at increasing rate. This case is not equivalent to the former one and only the 
intermittency p aramter can be faithfully estimated. Indeed, because of the self-similarity of the process, the integral scale 
is no longer a parameter and can arbitrarily chosen to be the overall sample lenght while the estimator of converges towards 
a random value. 

Our approach has been applied to financial time series for which it is well known that log-normal MRW provides a 
particularily parcimonious model that allows one to reproduce most of well documented stylized facts. At 5% confidence level, 
our estimates show that all the analyzed stock return series are multifractal but with a small intermittency coefficient }? ~ 0.02. 
The low intermittency <C 1 approximation is thus likely to be sound. Moreover, our estimates of T values suggest that 
the integral scale magnitude order is greater than one or several years. The ability of log-normal MRW to model volatility 
dynamics has been illustrated by its perfomances in conditional Value at Risk forecasting. From a practical point of view, the 
main interest of MRW-like models is that they capture the "heteroskedastic" nature of return fluctuations, by preserving, in 
some sense, the nice stability properties accross time scales of the Brownian motion. 

Time series analysis involving multifractal processes is still in its infancy. In forthcoming studies, we will extend the approach 
presented here to continuous cascades with arbitrary log-infinitely divisible laws. In particular the prospect to define a semi- 
parametric test for the multifractal nature of a time series is very appealing. The two asymptotic regimes discussed in this 
paper, also challenge many interesting issues: in some recent work, we have shown that they can be described within the 
general framework of "mixed asymptotics". In this regime, the overall sample lenght L increases while the sampling scale 
T — > [33]-[36]. A priori many statistics related to multifractal processes and notably the extreme value statisitics explicitely 
depend which "mixed asymptotics" we are, i.e., on the relative velocity according to which L ^ oo and r ^ [33]. 

Appendix A 

Existence of the renormalized magnitude Q{t) 

Lemma 1 ( Convergence of the finite dimensional laws of fi; [t) ): Let <i , . . . , <„, n be real numbers, then the Gaussian vector 
(ili{ti), . . . , ri;(i„)) converges, when I goes to zero, toward the centered Gaussian vector (il{ti), . . . , il(i„)), which does not 
depend on A and which covariance matrix reads: 

(S),fe = Cov[nitj),n{tk)] JduJ dvp{u-v), (98) 



where the function p{t) is defined by 

(xHn(^), if|i|<T, 
pit)^{ Vl*iy' II (99) 

lo, if T<\t\. 

Proof: The function pi{t) being defined (cf. section as the correlation function of the process uji^rit), the vector 
(p.i{ti), . . . , ili{tn)) is a centered Gaussian vector which covariance matrix is 

tj tfc 

{^i),k^Cov[ni{tj),ni{tk)] = ^ J '^''J dvpi{\u-v\), (100) 



This matrix converges toward the matrix S (which all coefficients are finite) when ^ — > 0. It thus suffices to show that S is 
semi-defined positive. It can be shown [32] that the function p{t) can be written as 



p{t)^ I p{t-s)p{s)ds. (101) 

If one defines the vector V{s) as 

V{s) = ( / - s)dui, I p{un - s)dun^ (102) 



tl t„ 
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then the matrix S can be written as 



oo 

S = y V{s)V^{s)ds, 



(103) 



thanks to the identity p{s) = p(—s). Consequently, E semi-defined positive. ■ 
Because the variables Hi{t) and Q{t) are Gaussian, in order to show the tighness of the sequence {e^^'''*'}t, it is sufficient 

to show the following proposition: 

Lemma 2 (Tightness): It exists e G [0, 1[ such that 



Proof: A direct computation leads to the following equation: 



(104) 



E 



g8Var[0(t)] _|_ g8Var[Sl(s)] _ 2g2Var[0(t)+Sl(s)] 

The first term of dlOSt can be estimated as 



(■ 



,4Var[n(t)] _ 4Var[0(s)] 



o((t-.f-) 



(t~s)2 + o((i-,s)2) 



and the second term as 

2g4Var[n(t)]+4Var[n(t)] _ g-2Var[f2(|t-s|)] 



„((i -,)"-). 



Lemma 3: If /i, . . . , /„, be n arbitrary intervals then 



E 



■ I ) J 



where K{Ii, ...,/„) reads 




(106) 



(107) 



otherwise, 



where p{t) is defined by Eq ( |99] l and P{In) is the set of all non ordinated partitions of two elements of X„. An element of 
P{In) can thus be written as {{ik-, jk)}k=i,...,7i/2- 

Proof: This result can be obtained from a direct computation relying upon Wick's Theorem [58] and Lemma [1] of 
Appendix |A] ■ 



Appendix B 

Taylor expansion of the moments of the measure 

The following proposition links the centered generalized moments of M with those of Vl. It will be used to prove limit 
theorems of sections IIII-CI and IIII-DI 

Proposition 12: Let n some positive integer The generalized centered moment of the log-normal MRM measure of the 
intervals /i, . . . , /„ admits the following Taylor series expansion when 0: 



E 



(M{h) 



1 



M(/„) 



1 



2"A"E 



\h\ \In\ 



0(A"), 



(108) 



A l^ll 7 V \in\ 
where f2 is the renormalized magnitude defined in Section IIII-BI 

Proof: Let us note that the right handside of Eq. (IIO8I 1 is given by Lemma [3] of Appendix |A] We are going to prove that 
the left handside of Eq. ( IIO8I 1 is equal to the same expression. 
Let us first indroduce the following random variables: 



\I3\ 



pour j = l,...,n. 



(109) 
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The centered generalized moment corresponding to intervals /i , . . . . /„ can be written as a linear combination of generalized 
moments: 

n 

E[(A/i-l)...(Af„-l)] = ^(-1)"-™ J2 IE[A4---M,„]. (110) 
From integral represention ( I121l i. we have: 



E[{M,^l)---{M^-l)]^±{-ir- Y: IPl-- - I P^e''^(^-\ (111) 

m=0 I™CI„/ ' / ' 

where S{Im) is the following symmetric sum; 

5(X„0= ^ X,^.,,, (112) 

ij<ik 

with 

It is possible to integrate previous expression as respect ui, . . . , u„: 



and changing the order of integration leads to: 



h 



m=0 I,TiCl„ 



In [32] it is shown that the generalized moments as functions of belong to C 3 (R) the class of ^ times continuously 
differentiable functions. The fc-th derivative of the generalized centered moment as respect to in A^ = reads: 

(^)V[,M.-i)..-(M,.-i)i|^^^^./ii.../iiii;Mr" y. (^M' <"« 

r T m=0 ImCin 

Let us consider some arbitrary integer j. One can regroup the terms under the integral in jl 161 ) 

n n — 1 

E (-1)""'" E ('5(1™))' = E (-1)""" E - u {j}))' (117) 

m=0 XmCl„ m=0 r„Cl„\{j} 

By noticing that the symmetric sum 5(X,„ U {j}) can be rewritten as 

5(X™U{j})=5(X™)+C(j,J„), (118) 

where C{j,Tm) is defined as 

C(i,X™)= E ^J'- (119) 

we can see that ^ 

(5(J™))' - {S{I,n U O'}))' = -C{j,I„.) Y—^--{C{jan.)y-\s{X,n)t'\ (120) 

This last relationship means that each term of the sum ( II 17b contains at least one factor like Xji^^ . Because j is arbitrarily 
fixed, the sets of factor indices of each term of the sum jl 17b must contain all indices 1, . . . ,rt. Therefore, if the derivative 
order k is smaller than n/2, the sum (II 17b must vanish. It results that the first non trivial order in the Taylor series in power 
of A^ of the centered generalized moment is at least ^^±1 if ^ est odd, and ^ if n is even. In this latter case, the first order 
terms in the Taylor series are proportionnal to X^Xi^j^ ■ ■ ■ Xi^^^^j^^^, where the set of indices . . . ,in/2^ jn/2} contains 

all the values 1, . . . , n. Such terms can only come from the expansion of s{in) ^ which leads to the expected result. ■ 

The following Lemma will be used to prove the limit theorem of Section IIII-EI 
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Lemma 4: For some arbitrary intervals Ii . . . , /„, the generalized moments of a log-normal MRM measure has the following 
integral representation: 



E 



M{h) M{La) 
\h\ \In\ 



dUr, 



where the function fx2 (ui, . . . , w„) is 



/a2(ui, . . . ,M„) = exp I 4A^ ^ p{uiUj] 

l<2<j<n 



(121) 



(122) 



where p{t) is defined by Eq 

Proof: This directly results from the fact that lim;^o+ E [Af;^T[0, t]] = ]E[AfT[0,t]] — t [14] and some simple algebra 
(see [32]). ' ■ 

For the sake of simplicity let us introduce the following set of indices: 



,ijn} Cin = {l,...,n}. 



(123) 



Appendix C 

Taylor expansion of the moments of the logarithm of the measure 

In this section we establish some results useful to prove limit theorems of Sections [III-DI and IIII-CI According to Proposition 
|5j all the moments of the MRM of negative orders are finite, consequently the moments of the logarithm of the MRM are also 
finite. 

Along the same line as in previous appendix, one can write an expansion for the generalized moment of the logarithm of a 
log-normal MRM measure. 

Proposition 13: Let n be a positive integer. The generalized centered moment of the logarithm of the MRM measure of 
intervals /i , . . . , /„ admits the following Taylor series expansion around = 0: 



E 



In 



Mjh) 



••In 



M(/„) 



= E 



Mjh 



- 1 



M{In 



- 1 



(124) 



where K{Ii, . . . , /„) are defined Eq. ( |107| l. Let us recall that K{Ii, . . . , /„) = if n is odd. 

The proof of this Proposition is postponed to the end of this section, it is based on the following Lemma |5] |6] and |7] : 
Lemma 5: Let < e < 1 and m be a positive integer. One has the following inequality: 

P[M ^ B^] < o(A^''"), (125) 

where the compact subset of R", i?^ is defined by: 

B^^ {x = {xi,...,Xn) eW; max (|xfc - 1|) < e|. (126) 

and o(A^™) depends on de e. 

Proof: Since A^ 0, then, without loss of generality, one can assume that the order 2m + 2 centered moment of the 
log-normal MRM measure exists (see Theorem |3]l. Thanks to Bienayme-Tchebychev inequality, one has: 



n n 



(127) 



Lemma 6: Let < e < 1. For all continuous function f{M) over the compact set B^ defined in (I126l l. the following 
inequality holds: 



E [/(A/)] - E [/(A/)|Af e B,] < v/E[/(Af)2]Jp[A// ^ B,] + ( sup |/(Af)| P[Af ^ B,] . 

Proof: From the law of total probabilities, it follows that 

E [/(Af)] - E [/(Af)|A/ € B,] = E [/(Af)|Af ^ B,] ¥[M ^B,]+E [/(A'/)|Af e B,] (P[Af e B,] - l) 

= E [/(Af)|Af ^ B,] ¥[M ^ B,]-E [f{M)\M e B,] F[M ^ B,] , 

where the first term can be bounded using Cauchy-Schwartz inequality 



(128) 



E [/(A/)|Af ^ B,] F[M ^ Be] = E [/(Af)l{M^Bj] < [fiM)^]^P[M ^ B, 



(129) 
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whereas the second term is bounded by the supremum over the compact set 

E [f{M)\M e B,] ¥[M (^B,]<( sup |/(A/)|)p[M ^ B,]. 



(130) 



Lemma 7: Let n be a positive integer In the compact set B^, with e < 1, one has the following identity of Taylor series 
expansions up to order n around = 0: 



E 



M e Be 



= E 



n(M,-i) 



M e B^ 



o(A") 



Proof: Thanks to the identity ln(.x) = a;— 1 — (x— 1 — ln(.-E)), one has 



n ln(A/,) - ^{M, - 1) = n - 1 - i^'h - 1 - ln(A^,))) - W^M, - 1) 

j=i j=i j=i j=i 



(131) 



(132) 



The expansion of the first product of r.h.s. leads to a linear combination of terms such as Hj^i (M^^ — 1— ln(A£ij )) Hj^fe+i (-^^^j ~ 
1) with \ <k <n which conditional expectation can be bounded using Cauchy-Schwartz inequality and the following inequality 
established in [32]: 



e - ln(l + e) 



<x- hi(l + .t) < - 



e + ln(l - e) 



, for all < e < 1 and x <E [— e, e] 



It results that 



(133) 



E 



< E 



i=i 



M e Be 



1/2 



E 



1/2 



M e B, 



_i=i 

Using ( I142l i. with n = 2 and n = 4, it is possible to remove the condition M E B, 



Yl (M,^. - 1)2 Af e Be 

1/2 

j=k+i 



M e Be 



1/2 



(134) 



E 



n(M,^-i-in(A/,j) n (^^^. -i; 

3 = 1 j=k+l 



M e Be 



(0(A4'=) + o(A''"))^^^(C'(A2"-2'=) + o(A2"))^/^ C'(A"+'=) = o(A"). (135) 



We are now ready to give the proof of Proposition [T3] 

Proof of Proposition 173} For simplicity purpose let us introduce the random variables: 



Mi 



mi, 
I/. I 



, for j = 1, . . . ,n. 



(136) 



The major difficulty of the proof relies in the fact that the Taylor series of ln(l + a;) around a: = converges in the interval 
(— 1, 1). Let < e < 1 and let us consider thevector M = (Afi, . . . , Af„) and the compact set Be defined in (1126b . Let us 
also define the two remaining parts i?'"(A) and -R'^(A) as: 

B'"(A) = E [In(Afi) • • • ln(Af„)] - E [in(Afi) • • • ln(A'f„) | Af e Be] , (137) 
i?^(A) = E [(Afi - 1) • • • (Af„ - 1)] - E [(Afi - 1) • • • (Af„ - l)|A'f e B,] . (138) 

By applying Lemma|6]to i?'"(A) and -R'^(A), it follows 



|i?'"(A)| < VIE [ln(A.fi)2 . . . hi(Af„)2]y^P[A/ ^ Be] + | ln(l - e)rP[Af ^ B,] , 
I ( A) I < VE [(Ml - 1)2 • • • (Af„ - l)2]y'p[A/ ^ Be] + e"P [A// ^ Be] . 



(139) 
(140) 
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Using the analytical expression of the g-order moment of M (cf Proposition [8]), one shows that the expectation term in ( 11391 ) 
is uniformely bounded in A^, for smaller than a (small enough) given Aq. In [32], it is shown that the the expectation term 
in (|140t is also uniformely bounded for A^ < A§. Using Lemma |5] with m = n, one gets the following inequalities; 

|i?'"(A)| < CA"+\ (141) 
|i?'(A)| < CA"+\ (142) 

where C depends on e. 

According to |2l from definitions (I137l l and ( I138l l and the bounds ( I141l l and (I142l i. we have: 

E [In(Afi) • • • ln(Af„)] - E [{Ah - 1) • • • (M„ - 1)] = o(A"). (143) 



Appendix D 

Proof of the consistency of the estimator dgOli in the high frequency regime 

In this section we provide of "simplified" proof of the consistency of the estimator defined in Eq. ( l90l l. More presisely, we 
prove the second part of theorem [TO] claiming that if n, n' are different integers such that Q < nr <T and < n'r < T, then 
in the high frequency asymptotic regime t — + 0, the estimator defined by ( |90] l is consistent and its variance decreases as 



(144) 



(with N = L/t). 

Proof: In the following we set 

Z,,,[n] =ln|J,l,,T(nT)|, (145) 

where Xi,t is the linear-wise process defined in Section Hl-EI It is defined by Eq. (|35l l which uses the measure t defined 
by ( [34] i. Let us recall that Xi,r{t) (resp. Mi,T{dt)) converges towards Xxit) (resp. MT{dt)) when I [14]. With no loss 
of generaUty, we choose I such that Z/r is an integer. Notice that according to Eq. (|35l l one has: 



{SrXi^T{nT)}„ = {e[n]J SrMi^T{nT)}„, (146) 

Law v 

where e[n] is a gaussian white noise independent. It results that 

Zi,r[n] - l\n5rMi,T{nr)+ln\e[n]l (147) 

Law Z 

and since we will consider below (empirical) co variances of Z[n] for lags n > 0, we will not longer take care of the terms 
ln|e[n]|. 

The empirical covariance function involving the cut-off scale I will de denoted as i?; ,-[n]: 

1 ^"'^ / 1 ^ V 

Rl,rM = T7 Zi^r[k]Zi^Ak + "] - T7 I] ^Lr W • (148) 



k=l \ k=l / 

and naturally 

RiAn,n'] = Ri^r[n] - Ri,r[n']. (149) 

A little algebra is sufficient to establish that 

Ys^r[Ri,r[n,n']] ^ N'^ ^ Cov[Zi,r[j]Ztr[j + n], Zur[k]Zi,r[k + n]] + €ov[Zi,r[]\Zi,r[] + n], Zi,r[k]Zur[k + n]] 

- 2Cov[Zi^r[i\ZiA3 + n], Zi^r[k]Zi^r[k + n']] (150) 

Therefore, in order to control the variance of the estimator one needs to control each term involved in the previous equation. 
Let us introduce some additional notations. Let [ioi^i] some time interval such that tg/l and ti/l are integers and such that 
1^1 — to\ > For all t — to + jl with < j < (ti — to)/l, we will decompose uji^T{t) — 7'(-4;,t(^)) as: 

^/(t) = 0[to,u]+4o.*ilW (151) 
where Oft^.ti] and S'^^^ independant gaussian process defined by 



OltaM] = l-pln^lAto + kl)) 
^ k ^ 

- ^v(Ai,T{to+jl)\f]Ai,T{to + kl) 

\ I, 
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Let us now consider two disjoint intervals [ioi^i] ™d [sqjSi] and let S the distance between the middle of each interval: 
6 — [sq + si — to — ti) /2. Then after some algebra it can be shown that there exists two spherical gaussian vector^ dj^^ (of 
dimension 3(ti — to)/0 and r|^^ (of dimension 3(si — sq)//) such that, \/{j,k) e [0, (ti — to)/'] x [0, {si — sq)/1], 5*'^^ tjO) 
is a j-dependent Hnear combination of the components of dj^^ while si](^) ^ fc-dependent linear combination of the 
components of r|^^ ^^j. Moreover, when S 3> sup(|si — so|, \ti — to|)^ one has 



Cov 

Cov 
Cov 
Cov 
Cov 
Cov 



4o.ti]W'<^[to,u] 
4o,ti]W'<^[to,ti] 



.52 

otherwise 
I 



if < fc < (ii - to)/' and 0<i< (si - so)// 



O 



if < fc < (ti - to)/' 



otherwise 

l 



(152) 



= if 0<z<(si-so)/' 

= otherwise 



Cov [0[t„,t,],Ois„,s,]] 

Let some integer n > 0, according to these notations, Zi^r[k]Zi,r[k + n] can be rewritten as: 



RA-) 

T 



ZlAk]Zl^r[k + n] - Op_i)^,(fc+„)r] + 0[(fc-l)r,(/c+«)r]/l(?'|(fc-l)r,(fe+n)r]) + (''[(fc- l)r, (fe+n)r] ) 



(153) 



where /i and /2 are two non-linear functions of the spherical noise (fc+n)T] ^^^^ shown to have a second 

moment that is bounded uniformely in I. If one uses the decomposition ( |153l l in expression dlSOt . each covariance term in 
( 11501 ) will give 9 terms that can be of 6 different forms: If one denotes Ij the interval [jr, (j + n)r] or [jr, {j + n')r]: 

(*i) iV-2Co40|^.,0,,/i(r^J] 

(zv) N-'Cov[Md\,)j2iri)] 
iv) N-^Cov[OIJ^{d'JA,h{ri)] 
(m) N-^Cov[Olj2ir'jJ] 

In order to prove the consistency of the estimator, we have to prove that the contribution of each term (i)-(vi) vanishes when 
N oo (after taking the limit I 0). We will just explain how to take care of the terms of type (iv). The other terms are 
dealt with in the same way. The terms of type (iv) are of the form 



(154) 



We need the following technical Proposition proved in [41] concerning the covariance of non-linear function of gaussian 
vectors: 

Proposition 14: Let A and B be two spherical gaussian vectors which dimensions are respectively p and q. Let us denote Cki 

the cross covariance between components: Cki — Cov [Ak, Bi] and p ~ Let / : IR and g : R*^ — > R 
be two non-linear functions such that: 

E[/(A)2] +E[5(B)2] <oo (155) 

then, if p < 1/2 we have: 



Cov[/(A),g(B)] < 2 (E [f{Af] E [g{-Bf]Y'% 



(156) 



Taking care of the term ( |154| l is a special case of the previous Lemma with p = 1 and q = \Ik\/l- Accordingly and using 
( 11521 ) one has (since \Ik \ ~ r) 



\ik\/i 



n=0 



I 



:)-0( 



■) 



(157) 



spherical gaussian vector is a vector made of independant standard normal variates 
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and therefore, there exists K < oo, such that, uniformely in I, 



Cov[Ol,Mr'jJ] < 



K 



\] - k\ 



(158) 



Let us now remark that in the double sum ( 1150b . since Zt — Inj^T-Xrl admits a finite four order moment, one has just to 
consider the case where (j, k) are such that \i — fc| > with v <1 because the contribution of terms \j — fc| < N'^ can be 
bounded as 



which converges to zero when — > oo. Hence, one can choose some > in order that sup(ri, n') ^ \i -~ k\. Under that 
condition, the term ( I154l i can be bounded, uniformely in / by K'^^j^ which vanishes in the Umit N +oo. 

The same kind of computation can be lead for each of terms like (ii)-(v) and we do not report the details here for the 
sake of concision. The only problem remains for (i) terms. But in that case, since the random variables O are gaussian wich 
covariance is nothing but RtHJ — k\], thanks to Wick's theorem the covariance of products of O can be expressed in terms of 
Rr and it can be shown, that the main remaining contribution can be bounded like: 



|fc|<Ar 

whith some constant the depends only on n and n'. Finally, by merging all contributions together we have proved that there 
exists a constant C that depends only on n and n' such that: 



(159) 




(160) 



Yar[Ri^r[n,n']] < C 



(161) 



and by taking the limit I one obtains (I144l l and the consistency of the estimator. 
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